Adding readManyByPartitionKey API by FabianMeiswinkel · Pull Request #48801 · Azure/azure-sdk-for-java

FabianMeiswinkel · 2026-04-13T23:00:57Z

Description

Adds a new readManyByPartitionKey API surface to the Java Cosmos SDK (sync + async) and wires it through the Spark connector to support PK-only reads (including partial HPK), with query-plan-based validation for custom queries.

Changes:

Added public readManyByPartitionKey overloads in CosmosAsyncContainer / CosmosContainer and an internal AsyncDocumentClient + RxDocumentClientImpl implementation that groups PKs by physical partition and issues per-range queries.
Introduced ReadManyByPartitionKeyQueryHelper to compose PK filters into user-provided SQL and added a new config knob for per-partition batching.
Added Spark support (UDF + PK serialization/parsing helper + reader) and unit/integration tests for query composition and end-to-end behavior.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

…to users/fabianm/readManyByPK

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a new readManyByPartitionKey API surface to the Java Cosmos SDK (sync + async) and wires it through the Spark connector to support PK-only reads (including partial HPK), with query-plan-based validation for custom queries.

Changes:

Added public readManyByPartitionKey overloads in CosmosAsyncContainer / CosmosContainer and an internal AsyncDocumentClient + RxDocumentClientImpl implementation that groups PKs by physical partition and issues per-range queries.
Introduced ReadManyByPartitionKeyQueryHelper to compose PK filters into user-provided SQL and added a new config knob for per-partition batching.
Added Spark support (UDF + PK serialization/parsing helper + reader) and unit/integration tests for query composition and end-to-end behavior.

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
sdk/cosmos/docs/readManyByPartitionKey-design.md	Design doc describing the new API, query validation, and Spark integration approach.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/query/DocumentQueryExecutionContextFactory.java	Adds a helper method to fetch query plans through the gateway for validation.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/RxDocumentClientImpl.java	Implements `readManyByPartitionKey` execution, validation, PK→range grouping, batching, and concurrency.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/ReadManyByPartitionKeyQueryHelper.java	New helper to build `SqlQuerySpec` by appending PK filters and extracting table aliases.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/Configs.java	Adds config/env accessors for max PKs per per-partition query batch.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/AsyncDocumentClient.java	Adds internal interface method for PK-only read-many.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/CosmosContainer.java	Adds sync `readManyByPartitionKey` overloads.
sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/CosmosAsyncContainer.java	Adds async `readManyByPartitionKey` overloads and wiring to internal client.
sdk/cosmos/azure-cosmos-tests/src/test/java/com/azure/cosmos/implementation/ReadManyByPartitionKeyQueryHelperTest.java	Unit tests for SQL generation, alias extraction, and WHERE detection.
sdk/cosmos/azure-cosmos-tests/src/test/java/com/azure/cosmos/ReadManyByPartitionKeyTest.java	Emulator integration tests for single PK + HPK, partial HPK, projections, and query validation.
sdk/cosmos/azure-cosmos-spark_3/src/test/scala/com/azure/cosmos/spark/ItemsPartitionReaderWithReadManyByPartitionKeyITest.scala	Spark integration test for reading by PKs and empty result behavior.
sdk/cosmos/azure-cosmos-spark_3/src/test/scala/com/azure/cosmos/spark/CosmosPartitionKeyHelperSpec.scala	Unit tests for PK string serialization/parsing helpers.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/udf/GetCosmosPartitionKeyValue.scala	Spark UDF to compute `_partitionKeyIdentity` values.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/ItemsPartitionReaderWithReadManyByPartitionKey.scala	Spark partition reader that calls new SDK API and converts results to rows.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/CosmosReadManyByPartitionKeyReader.scala	Spark reader that maps input rows to PKs and streams results via the partition reader.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/CosmosPartitionKeyHelper.scala	Helper for PK serialization/parsing used by the UDF and data source.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/CosmosItemsDataSource.scala	Adds Spark entry point to read-many by partition key, including PK extraction logic.
sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/CosmosConstants.scala	Adds `_partitionKeyIdentity` constant.

Comments suppressed due to low confidence (1)

sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmos/spark/ItemsPartitionReaderWithReadManyByPartitionKey.scala:1

The error message has mismatched parentheses/quoting (classOf<SparkRowItem])) which makes it harder to read and search for. Suggest correcting it to a clean, unambiguous string (e.g., classOf[SparkRowItem]) to improve diagnosability.

// Copyright (c) Microsoft Corporation. All rights reserved.

…s/spark/ItemsPartitionReaderWithReadManyByPartitionKey.scala Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…ntation/RxDocumentClientImpl.java Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…ntation/ReadManyByPartitionKeyQueryHelper.java Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…to users/fabianm/readManyByPK

FabianMeiswinkel · 2026-04-16T21:57:35Z

@sdkReviewAgent

FabianMeiswinkel · 2026-04-17T11:59:11Z

@sdkReviewAgent

…/azure-sdk-for-java into users/fabianm/readManyByPK

xinlian12 · 2026-04-17T16:28:37Z

✅ Review complete (44:59)

Posted 9 inline comment(s).

_{Steps: ✓ context, correctness, cross-sdk, design, history, past-prs, synthesis, test-coverage}

…to users/fabianm/readManyByPK

FabianMeiswinkel · 2026-04-17T18:27:56Z

@sdkReviewAgent

FabianMeiswinkel · 2026-04-17T18:28:24Z

/azp run java - cosmos - spark

azure-pipelines · 2026-04-17T18:28:34Z

Azure Pipelines successfully started running 1 pipeline(s).

xinlian12 · 2026-04-17T19:31:34Z


        Map<PartitionKeyRange, SqlQuerySpec> rangeQueryMap = new HashMap<>();
-        List<String> partitionKeySelectors = createPkSelectors(partitionKeyDefinition);
+        List<String> partitionKeySelectors = ReadManyByPartitionKeyQueryHelper.createPkSelectors(partitionKeyDefinition);


🟢 Observation: createPkSelectors refactoring changes behavior for nested PK paths in existing APIs

The old private createPkSelectors used StringUtils.substring(pathPart, 1) which treated /address/city as a single segment producing ["address/city"]. The new implementation uses PathParser.getPathParts which correctly splits into ["address"]["city"].

This is a bug fix for nested partition key paths (the old selector looked for a property literally named address/city rather than traversing nested objects), but it changes behavior for two existing code paths:

getRangeQueryMap → used by readMany(List<CosmosItemIdentity>)

createLogicalPartitionScanQuerySpec → used by readAllItemsOfLogicalPartition

The fix is correct and the risk is low (nested PK paths are uncommon, and the old behavior was wrong), but it may be worth noting in the CHANGELOG since it changes existing readMany behavior.

⚠️ AI-generated review — may be incorrect. Agree? → resolve the conversation. Disagree? → reply with your reasoning.

xinlian12 · 2026-04-17T19:31:34Z

+            throw new IllegalArgumentException(
+                "Custom query for readMany by partition key must not contain LIMIT.");
+        }
+        if (queryInfo.hasNonStreamingOrderBy()) {


🟡 Remaining: SELECT TOP N queries not rejected by validation

OFFSET and LIMIT were added to the rejection list (fixing the earlier comment), but queryInfo.hasTop() is still missing. SELECT TOP 5 * FROM c would pass validation and the SDK would split it across N physical partitions × M batches, each independently limiting to 5 rows — returning up to 5 × N × M results instead of the expected 5.

hasTop() is available on QueryInfo (line 71 of QueryInfo.java) and is used elsewhere in the codebase (e.g., DocumentQueryExecutionContextFactory:294).

Suggested fix: Add before the hasNonStreamingOrderBy() check:

if (queryInfo.hasTop()) { throw new IllegalArgumentException( "Custom query for readMany by partition key must not contain TOP."); }

⚠️ AI-generated review — may be incorrect. Agree? → resolve the conversation. Disagree? → reply with your reasoning.

xinlian12 · 2026-04-17T19:32:05Z

+        }
+
+        return UtilBridgeInternal.createCosmosPagedFlux(
+            readManyByPartitionKeyInternalFunc(partitionKeys, customQuery, requestOptions, classType));


Review Summary: readManyByPartitionKey API

Overall assessment: This is a well-structured, comprehensive addition to the Cosmos SDK. The PR author has been very responsive to feedback — most of the 56 existing review comments have been addressed with fixes.

Key issues resolved since initial review

✅ StaleResourceRetryPolicy wrapper added for stale cache resilience

✅ PartitionKey.NONE NPE handled via effectivePkInternal fallback

✅ End-to-end timeout policy now applied to Spark reader

✅ feedResponseProcessedListener diagnostics callback added

✅ SQL parser now handles escaped single quotes ('')

✅ OFFSET/LIMIT validation added to custom query checks

✅ Parameter name collision avoided with @__rmPk_ prefix

✅ Batch size default/doc mismatch aligned

✅ Null handling made configurable (Null vs None semantics)

✅ Empty PK list short-circuit in Spark reader

✅ Distributed tracing confirmed via CosmosPagedFlux infrastructure + test coverage

Remaining items (2 new inline comments posted)

🟡 SELECT TOP N not rejected — OFFSET/LIMIT were added but hasTop() is still missing, which could produce semantically incorrect results (per-batch limiting instead of global)

🟢 createPkSelectors refactoring — The move to PathParser.getPathParts fixes nested PK path handling but silently changes behavior for existing readMany and readAllItemsOfLogicalPartition APIs (low risk, but worth a CHANGELOG note)

Architecture highlights

Good use of round-robin interleaving across physical partitions for concurrent execution

LinkedHashMap for deterministic iteration order is a nice touch

The TransientIOErrorsRetryingReadManyByPartitionKeyIterator with page-committed tracking is a solid retry strategy

PK deduplication via canonical JSON representation handles type coercion correctly

⚠️ AI-generated review — may be incorrect. Agree? → resolve the conversation. Disagree? → reply with your reasoning.

xinlian12 · 2026-04-17T19:32:13Z

✅ Review complete (04:06)

Posted 3 inline comment(s).

_{Steps: ✓ context, correctness, cross-sdk, design, history, past-prs, synthesis, test-coverage}

FabianMeiswinkel added 2 commits April 13, 2026 22:49

Adding readManyByPartitionKey API

9770833

Merge branch 'main' of https://github.com/Azure/azure-sdk-for-java in…

ac287bc

…to users/fabianm/readManyByPK

FabianMeiswinkel requested review from a team and kirankumarkolli as code owners April 13, 2026 23:00

Copilot AI review requested due to automatic review settings April 13, 2026 23:00

FabianMeiswinkel marked this pull request as draft April 13, 2026 23:01

github-actions bot added the Cosmos label Apr 13, 2026

Copilot started reviewing on behalf of FabianMeiswinkel April 13, 2026 23:07 View session

Copilot AI reviewed Apr 13, 2026

View reviewed changes

FabianMeiswinkel and others added 18 commits April 14, 2026 16:49

Update sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmo…

9a5b3e9

…s/spark/ItemsPartitionReaderWithReadManyByPartitionKey.scala Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update sdk/cosmos/azure-cosmos-spark_3/src/main/scala/com/azure/cosmo…

a8720c3

…s/spark/ItemsPartitionReaderWithReadManyByPartitionKey.scala Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/impleme…

d499da7

…ntation/RxDocumentClientImpl.java Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/impleme…

c3c542a

…ntation/ReadManyByPartitionKeyQueryHelper.java Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

´Fixing code review comments

4416354

Merge branch 'main' of https://github.com/Azure/azure-sdk-for-java in…

3ab3f0d

…to users/fabianm/readManyByPK

Update CosmosAsyncContainer.java

588a755

Merge branch 'main' into users/fabianm/readManyByPK

8c5cdb4

Update ReadManyByPartitionKeyTest.java

f548552

Fixing test issues

f68cf02

Update CosmosAsyncContainer.java

8b6c4b1

Merge branch 'main' into users/fabianm/readManyByPK

8ba7f4d

Reacted to code review feedback

56b067a

Merge branch 'main' into users/fabianm/readManyByPK

fa430e9

Fix build issues

d9504c9

Merge branch 'main' into users/fabianm/readManyByPK

73151f0

Fixing changelog

681830e

Merge branch 'main' into users/fabianm/readManyByPK

7f745e6

FabianMeiswinkel marked this pull request as ready for review April 16, 2026 21:57

FabianMeiswinkel added 4 commits April 17, 2026 11:37

Fix readManyByPartitionKey retries

b01f875

Fix PK.None

7130d4a

Update ReadManyByPartitionKeyQueryHelper.java

93957f3

Merge branch 'main' into users/fabianm/readManyByPK

16dd1d6

FabianMeiswinkel added 5 commits April 17, 2026 12:50

Fix code review feedback

9200f8f

Merge branch 'users/fabianm/readManyByPK' of https://github.com/Azure…

4a3ea06

…/azure-sdk-for-java into users/fabianm/readManyByPK

Merge branch 'main' into users/fabianm/readManyByPK

e2aa124

Merge branch 'users/fabianm/readManyByPK' of https://github.com/Azure…

c34341e

…/azure-sdk-for-java into users/fabianm/readManyByPK

Reacting to code review feedback

c96b6f6